Load Shedding in Data Stream Systems

نویسندگان

  • Brian Babcock
  • Mayur Datar
  • Rajeev Motwani
چکیده

Systems for processing continuous monitoring queries over data streams must be adaptive because data streams are often bursty and data characteristics may vary over time. In this chapter, we focus on one particular type of adaptivity: the ability to gracefully degrade performance via "load shedding" (dropping unprocessed tuples to reduce system load) when the demands placed on the system cannot be met in full given available resources. Focusing on aggregation queries, we present algorithms that determine at what points in a query plan should load shedding be performed and what amount of load should be shed at each point in order to minimize the degree of inaccuracy introduced into query answers. We also discuss strategies for load shedding for other types of queries (set-valued queries, join queries, and classification queries).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

How to Screen a Data Stream - Quality-Driven Load Shedding in Sensor Data Streams

As most data stream sources exhibit bursty data rates, data stream management systems must recurrently cope with load spikes that exceed the average workload to a considerable degree. To guarantee low-latency processing results, load has to be shed from the stream, when data rates overstress system resources. There exist numerous load shedding strategies to delete excess data. However, the cons...

متن کامل

A Framework For Supporting Load Shedding in Data Stream Management Systems

The arrival rate of tuples in a data stream can be unpredictable and bursty. Many stream-based applications have Quality of Service (QoS) requirements that need to be satisfied by the underlying stream processing system. In order to avoid violating predefined QoS requirements during temporary overload periods, a load shedding strategy is necessary and critical for a data stream management syste...

متن کامل

Rate-Sensitive Load Shedding in Data Stream Systems

Traditional load shedding algorithms for data stream systems calculate current operator selectivity over several run periods and use them to determine where to shed load during the next run period. In this paper, we show that the current selectivity may change due to the implementation of load shedding. Our algorithm, called RLS, determines the optimum drop location by these changed selectivity...

متن کامل

Load Shedding Techniques for Data Stream Systems

Many data stream sources (communication network traffic, HTTP requests, etc.) are prone to dramatic spikes in volume. Because peak load during a spike can be orders of magnitude higher than typical loads, fully provisioning a data stream monitoring system to handle the peak load is generally impractical. Therefore, it is important for systems processing continuous monitoring queries over data s...

متن کامل

Content-based Load Shedding in Multimedia Data Stream Management System

Overload management has become very important in public safety systems that analyse high performance multimedia data streams, especially in the case of detection of terrorist and criminal dangers. Efficient overload management improves the accuracy of automatic identification of persons suspected of terrorist or criminal activity without requiring interaction with them. We argue that in order t...

متن کامل

SOSA: A Safe Load Shedding Approach for Monitoring Data Streams in Real-Time

Real-time stream processing is essential for many real-life stream-based applications. Systems designed to run such applications must be prepared to operate under overloaded conditions. Existing load shedding techniques are not suitable for processing data streams with stringent timing constraints because their tuple dropping policies may violate application deadlines in an uncontrolled way. To...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007